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0.  Abstract 

We  give  a  new  randomized  parallel  RAM  algorithm  for  finding  a  spanning 
forest  of  an  undirected  graph  in  logarithmic  time.  These  time  bounds  hold 
with  arbitrary  high  probability  for  any  input  graph  (i.e. ,  we  do  not  assume 
random  input;  these  bounds  hold  for  the  worst  case  input  graph) .  This  result 
assumes  a  parallel  RAM  model  which  allows  both  concurrent  writes  and  con¬ 
current  reads. 

Furthermore,  we  show  that  if  the  graph  is  not  very  sparse  (i.e.,  if 
the  number  of  edges  is  at  least  a  logarithmic  squared  factor  more  than  the 
number  of  vertices)  than  we  can  achieve  a  linear  processor  •  time  product 
(even  for  logarithmic  time  bounds)  for  finding  a  spanning  tree — which  is 
optimal  for  the  parallel  RAM  model.  Furthermore,  we  can  also  achieve  a 
linear  processor,  time  product  for  even  sparser  graphs  with  only  slight  time 
increase .  ' 
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1.  Introduction 


The  performance  of  a  parallel  algorithm  can  be  specified  by  bounds  on 
its  principle  resources:  time  and  processors.  For  most  nontrivial  graph 
problems,  the  product  ><Tof  the  number  of  processors  P  and  the  execution 
time  T  is  lower  bounded  by  Q(n+m) ,  where  n,m  are  the  number  of  vertices 
and  edges,  respectively  of  the  input  graph  in  adjacency  list  representation. 
Thus  for  these  graph  problems,  an  algorithm  is  optimal  if  P-T=0(n+m).  Of 
course  if  we  have  an  optimal  algorithm  with  time  T,  then  we  also  have 
(by  the  obvious  processor  simulation)  optimal  algorithms  for  any  time  bound 
T',  where  T^T'<0(n+m). 

The  depth  first  search  algorithm  of  [Tar j an,  72]  was  very  successful  in 

producing  optimal  sequential  RAM  algorithms  for  a  large  number  of  graph 

problems  such  as  undirected  graph  connectivity  and  biconnectivity  [Hopcroft 

and  Tar j an,  73].  No  optimal  graph  searching  method  has  been  proposed  for 

parallel  RAM,  for  poly log  time  bounds,  except  in  the  special  case  where  the 

2 

graph  is  extremely  dense  (i.e.  ,  m=ft(n  )). 

Previous  work  in  parallel  RAM  algorithms  has  yielded  few  optimal  results 

2 

for  graph  connectivity.  [Chin,  Lam  and  Chen,  82]  give  0( log  n)  time  con- 
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nectivity  algorithms  requiring  (n  +m)/(log  n)  processors,  which  is  optimal 
2  + 

only  if  m=ft(n  ).  [Reif,  82a]  gives  a  O(log  n)  '  time  connectivity  algo¬ 
rithm  for  undirected  graphs  using  randomization  (see  also  [Reif,  82b]  for  a 

2 

description  of  randomized  P-RAMs) ,  but  requiring  n  m  processors.  All  these 
algorithms  assumed  the  parallel  RAM  model  generally  known  as  the  P-RAM,  (see 


%ote:  We  use  the  notation  T(n)=0(f(n))  if  Vot>l3c>0  such  that 
Prob(T(n)  ^c  f  (n) ) >  1-n  a  for  sufficiently  large  n. 


[Fortune  and  Wyllie,  78])  where  concurrent  writes  on  the  same  memory  cell  are 
disallowed. 


The  WP-RAM  (see  [Shiloach  and  Vishkin,  81])  is  a  stronger  parallel  RAM 
model  where  concurrent  writes  on  the  same  memory  cell  are  allowed,  and  re¬ 
solved  arbitrarily.  This  machine  model  has  the  advantage  that  graph  problems 
appear  to  be  easier  to  solve  on  this  model.  The  P-RAM  can  simulate  the  WP- 
RAM  with  only  logarithmic  factor  of  time  increase  and  the  same  number  of 
processors,  by  use  of  the  parallel  sorting  algorithm  of  [Reischuk,  81]. 

In  particular,  [Shiloach  and  Vishkin,  82]  give  an  O(log  n)  time, 

0(n+m)  processor  WP-RAM  algorithm  for  undirected  graph  connectivity  and 
[Tarjan  and  Vishkin,  83]  recently  extended  this  result  to  bi connectivity . 
These  algorithms  have  processor,  time  product  f2(  (m+n)  log  n) ,  which  is  a 
logarithmic  factor  more  than  optimal.  [Vishkin,  84]  gives  an  almost  optimal 
n  processor,  and  O(log  n  log  n)  time  bound  for  finding  the  number  of 
successors  on  a  linear  list  of  length  n.  Vishkin  conjectured  that  random¬ 
ized  techniques  would  also  be  needed  to  get  optimal  parallel  graph  connec¬ 
tivity  algorithms. 


1 .  Organization  of  our  Paper  and  Statement  of  Results 

In  Section  2,  we  give  our  randomized  WP-RAM  algorithm  for  finding  a 
spanning  forest  of  a  graph  in  simultaneously  6 (log  n)  time  and  n+m 
processors.  The  randomized  algorithm  is  surprisingly  simple  (in  fact  con¬ 
siderably  simpler  than  previously  known  algorithms) ,  and  so  seems  to  be 
useful  in  practice. 

In  Sections  3  and  4  we  describe  some  modifications  of  our  algorithms 
which  results  in  an  optimal  processor • time  product  of  0(m+n) .  In  particular 
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Section  3  gives  an  0{log  n)  time  and  m/log  n  processor  algorithm  for  the 

2 

case  the  number  of  edges  m  is  at  least  n(log  n)  .  Furthermore,  Section  4 
gives  an  O(log  n  loglog  n)  time  and  m/(log  n  loglog  n)  processor  algorithm 
for  the  case  the  number  of  edges  m  is  at  least  n  logn  loglog  n. 

In  Section  5  we  give  as  an  interesting  application  of  these  techniques 
an  optimal  parallel  algorithm  for  finding  the  biconnected  components  of  any 
undirected  graph.  This  algorithm  has  the  same  complexity  bounds  as  our 
parallel  spanning  forest  algorithms.  In  the  full  paper  we  give  applications 
to  finding  minimum  spanning  trees  and  Euler  cycles. 


2 .  A  Randomized  Parallel  Algorithm  for  Spanning  Forests 

Let  G  =  (V ,E)  be  an  undirected  graph  with  vertex  set  V  =  {l,...,n} 
and  undirected  edge  set  E  of  size  m.  Let  its  directed  edges  be 
D(E)  =  { (u,v) | {u,v}  € e}  U  { (v,u) | {u,v}  £e}.  We  shall  associate  a  distinct 
processor  with  each  vertex  and  edge  of  G.  For  each  vertex  v£V,  we  shall 
have  integer  variables  R(v) ,  sex(v) ,  and  link(v),  and  for  each  directed 
edge  (u,v)  £D(E)  we  have  a  boolean  variable  tree-edge(u,v) .  The 
following  algorithm  is  to  be  executed  by  a  WP-RAM  as  defined  in  Section  1. 
We  assume  that  each  processor  v£V  is  provided  with  an  independent  random 
bit  generator  RANDv(0,l). 

Algorithm  RANDOM-MATE 
Input  graph  G  = (V,E) 

Initialization : 
begin 

for  each  v£V  in  parallel  do  R(v)  «-v 

for  each  (u,v)  £d(E)  in  parallel  do  tree-edge  (u,v)  •‘-false 


end 


i  n  Body : 


L'.’it  i_l  (R (u)  =  R  (v) )  for  all  (u,v)  £D(E)  do 
begin 

for  each  v  £v  in  parallel  such  that  R(v)  =v  do 
if  RAND^  (0,1)  =1  then  sex(v)  ■*-  female 
else  sex(v)  ^male 

f each  (u,v)  €d(E)  in  parallel  do  mate(u,v) 
fy£  each  v€v  in  parallel  do  R(v)  ■‘-R(R(v)) 
end 

Pl  anning  forest  F  =  {  {u  ,v}  6  E  |  tree-edge  (u,v)  } 

We  define: 

I loeouure  mate(u,v) : 

iex  (R(u) )  =  female  and  sex(R(v))  =  male  then 
begin 

attempt  to  assign  link(R(u))  -<-(u,v) 
if  link(R(u))  =  (u,v)  then 
begin 

t  ree-edge  (u,v)  true 
R(R(u) )  ■*-  R (v) 
end 

end 


•’-ii'-ndix  I  proves: 

IBtGREM  1.  The  total  number  of  parallel  steps  executed  by  random-mate  is 
0 ( log  n)  (Note:  see  the  footnote  in  the  Introduction  for  the  definition 
of  our  0 (  )  notation.) 
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3.  An  Optimal  Algorithm  for  at  Least  n(log  n)  Edges 

2 

Let  G  =  (V,E)  be  an  undirected  graph  with  n  vertices  and  m^ndog  n) 
edges  in  adjacency  list  representation  (we  use  a  vector  for  each  adjacency 
list) . 

Our  modified  algorithm  RANDOM-MATE'  will  first  reduce  (with  high  likeli¬ 
hood)  the  number  of  edges  and  nodes  to  m/log  n.  This  is  done  by  modifying 
RANDOM-MATE  to  execute  its  Main  Body  exactly  dQ 'log  n1  times,  where  dQ  is 
a  constant.  We  shall  assign  a  set  of  d^'log  n1  distinct  processors  for 

each  vertex  v  €V  where  d^  ^ 2  is  a  constant.  Each  processor  p  £ 
independently  chooses  a  random  list  E(p)  of  dQ 'log  n1  directed  edges  in  D(E) 
(not  necessarily  distinct)  departing  vertex  v. 

In  the  Main  Body  of  the  resulting  procedure  RANDOM- MATE'  is  the  same 
as  that  of  RANDOM-MATE  except  that  the  statement 

"for  each  (u,v)  £d(E)  in  parallel  do  mate(u,v)" 
is  replaced  with 

"for  each  v  €V  and  p  £Pv  in  parallel  do 
begin 

choose  and  delete  the  first  edge  e  from  E(^) 

mate (e) 
end" 

After  the  dQrlog  n1  iterations  of  this  modified  Main  Body,  RANDOM-MATE' 
deletes  each  "loop"  edge  (u,v)  6E  such  that  R(n)  =R(v)  using  time 
O(log  n)  and  m/log  n  processors.  With  high  likelihood,  the  resulting  graph 
is  of  size  m/log  n.  The  final  stage  of  RANDOM-MATE'  is  to  apply  the  original 
algorithm  RANDOM-MATE  to  this  resulting  graph,  using  m/log  n  processors. 
Appendix  II  proves : 

THEOREM  2.  The  total  number  of  parallel  steps  executed  by  random-mate' 
is  '(log  n)  using  d^/log  n  processors. 
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COROLLARY  2.  For  all  T  such  that  log  n<T^n+m,  we  find  a  spanning  tree 

in  time  O(T)  using  an  optimal  number  of  processors  p  =m/T  assuming 
2 

m  >n  (log  n)  . 


k .  An  Optimal  Parallel  Algorithm  for  at  Least  n  log  n  loglog  n  Ecices 

Let  G  ■  (V,E)  again  be  an  undirected  graph  with  V  =  (l,...,n}  and 
m=|E|  edges  which  we  assume  are  in  adjacency  list  representation  (when  each 
adjacency  list  is  given  by  a  vector).  To  obtain  optimal  processor  bounds  for 
the  case  m^n  log  n  loglog  n,  we  assign  a  processor  p  to  each 
rlog  n  loglog  n1  distinct  consecutive  directed  edges  in  the  adjacency  list 
of  each  vertex  v  £v.  We  name  these  processors  by  distinct  numbers  in 
P={l, —  f(n+m)/(logn  loglog  n)1};  so  for  each  p£P,  E(p)  initially  con¬ 
tains  at  most  log  n  loglog  n  edges  departing  the  same  vertex,  and 
D (E)  ='-'p€p  E(p). 

Algorithm  RANDOM-MATE" 

Input  graph  G  =  (V,E) 

Initialization 
begin 
n  ■*-  1  v  | 

m  «-  |e| 

p  *■  { 1 , . . .  '  (m+n)/  ( log  n  loglog  n)  1 } 
for  each  v  £v  in  parallel  do  R(v)  -*-v 
for  each  p€P  in  parallel  do 
begin 

construct  E(p)  as  described  above 

for  each  (u,v)  £E(p)  do  tree-edge (u ,v)  » false 
end 


end 


Outer  Loop: 


for  j  =  1, . . .  , f log log  n1  do 
begin 

Inner  loop : 

for  i  = 1 , . . . rlog  n1  do 
begin 

for  each  v  £V  in  parallel  such  that  R(v)  =v  do 
if  RAND^ (0,1)  =1  then  sex(v)  *■  female 
else  sex(v)  Tnale 
for  each  p  £P  in  parallel  do 
begin 

choose  a  random  (u,v)  fc"E  (p) 

mate (u ,v) 
end 

for  each  v£V  in  parallel  do  R(v)  ♦•R(R(v) ) 
end 

for  each  p  £P  in  parallel 

do  delete  each  edge  (u,v)  €E(p)  such  that  R(u)  =R(v) 

end 

Execute  until  termination  Main  Body  of  RANDOM-MATE 

Output  Spanning  forest  F  ={ {u ,v}  € E j tree-edge (u ,v)  } 

(Note  the  constants  c^,  c 2  are  chosen  so  as  to  achieve  any  given 
likelihood  of  success  as  bounded  in  Lemma  8.) 

In  Appendix  III  we  prove: 

THEOREM  3.  RANDOM-MATE"  takes  6 (log  n  loglog  n)  steps  using 
(m+n) / (log  n  loglog  n)  processors. 


COROLLARY  3.  For  all  T  such  that  log  n  loglog  n^T^n+m,  we  can  find  a 
spanning  tree  in  time  o(T)  using  an  optimal  number  of  processors  P=m/T, 
assuming  m>n  log  n  loglog  n. 


5-  Optimal  Parallel  Computation  of  Biconnected  Components 

Let  G=  (V,E)  be  an  undirected  graph  with  n  =  j v J  vertices  and  m  =  |e 
edges.  In  Appendix  IV  we  show 

THEOREM  k.  The  biconnected  components  of  G  can  be  computed  in  time  o(T) 

.  2 
using  p  =  m/T  processors ,  in  the  case  evlr.er  (T^log  n  and  m^n(log  n)  ) 

or  (T  ^log  n  loglog  n  and  m  >n  log  n  loglog  n) . 

The  key  idea  is  to  reduce  the  problem  (by  repeated  use  of  our  optimal 

graph  connectivity  algorithms)  to  computing  the  biconnected  components  of  a 

smaller  graph  G"  with  only  0(n)  edges  and  vertices. 
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APPENDIX  I.  Proof  of  RANDOM-HATE 

LEMMA  1.  On  termination 3  F  =  { (u,v)  € e| tree-edge (u,v) }  is  a  spanning  forest 

of  G. 

Proof.  We  use  an  induction  argument  (easily  seen  to  hold  for  the  first 
iteration)  that  for  just  before  iteration  i=l,2,...  of  the  until  loop, 

(1)  F^  = {{u,v)  €E|tree-edge (u,v) }  is  a  forest,  and 

(2)  for  all  u,v€v,  R(u)  =  R(v)  iff  u,v  are  in  the  same  tree  in 

F. 

l 

(3)  for  all  v  €V,  R(R(v))  =  R(v) . 

Let  be  the  tree  in  F^  with  root  w=R(w).  (By  the  induction 

hypothesis,  there  is  some  such  w) .  Suppose  link(w)  is  assigned  some 

edge  (u,v).  Then  sex< R(u))  /  sex(R(v)>  and  hence  w  =  R(u)  /  R(v). 

This  implies  by  the  induction  hypothesis  that  v  is  in  a  tree  TR(V)  of 

F.  distinct  from  T  .  observe  that  {u,v}  is  the  only  edge  departing  or 
i  w 

entering  Tw  for  which  tree-edge (u,v)  is  assigned  true  on  the  i-th 
iteration.  Hence  F^+^  must  be  a  forest.  At  the  last  step  of  the  i-th 
iteration  each  vertex  of  has  R(v)  assigned  R(v) ,  establishing  the 

induction  hypothesis  for  the  state  just  before  the  i+1  iteration.  o 

A  maximal  tree  spans  an  entire  connected  component  of  G. 

Let  ni  be  the  number  of  trees  in  F^  that  are  not  of  maximal  size. 

LEMMA  2.  If  n±  >  1,  Prob(ni+1  <  (3/^) n± )  >1/2. 

Proof.  Let  T  be  a  tree  of  F.  which  is  not  of  maximal  size.  Then 
-  w  1 

there  exists  at  least  one  edge  (u,v)  €D(E)  departing  T  to  a  distinct 

tree  T^vj  where  R(u)  *  w  f  R(v).  With  probability  1/4,  sex  (w)  =  female 
and  sex(R(v))  *  male.  Hence  with  probability  at  least  1/4,  T  is 
merged  into  some  other  tree  (not  necessarily  TR 


)  on  the  i-th  iteration. 
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In  the  worst  case,  each  nonmaximal  size  trer  T  in  F  has  only 

W  1 

one  such  departing  edge  (u,v)  where  w  f  R(v)  ,  and  each  targr*-  v  has 

a  distinct  R(v).  The  random  variable  n.  ,  is  upper  bm rd-d  ir.  thi.-. 

i+l 

worst  case  by  a  binomial  variable  B  ,  ,A  (which  is  the  r.ur.  r.f  n 

^,3/4  i 

independent  Bernoulli  variables,  each  with  success  probability  "’/•I'  • 

Observe  B  ,  has  mean  (3/4)n.  and  n.  ,  is  upper 
n^ ,3/4  l  i+l 

bounded  by  B  ,  ..  ,  so  Prob(n_,  ^(3/4)n.)  >  Prob(B  <(3/4'r.  ) 

n.,3/4'  i+l  i  n. ,3/4  i 

i  i 

>  1/2.  Q 

Our  time  analysis  will  utilize  the  following  probabi  1  i  st  i  <  in,. polity 
which  can  be  derived  (see  [Angluin  and  Valiant,  79])  from  the  b<‘t:\.V.  c-f 
[Chemoff ,  52]  and  [Hoeffding,  56]  . 

LEMMA  3.  Let  g  be  the  sum  of  N  independent  geometric  "■or'.:1-  y 

be  the  mean  of  g.  Then  for  all  a>0  there  exit*  *  a  a  c  >1  such  ‘■h-.t. 


Prob  (g  <c  u)  <  — — 
2  M 


for  N  sufficiently  . 


Let  iQ  =  1  and  inductively  for  k  =  l,2,...  let  i  be  the  nisi mum 

number  such  that  n.  ^(3/4)n,  .  By  Lemma  2,  each  i,  is  utp.-r  branded 

i,  l.  ,  k 

k  k-1 

by  an  independent  geometric  variable  with  expectation  7.  ri-  , 

ko 

n.  <1  for  k„  =  log.  ...  n,  1  =  1,  ,  i,  is  the  total  ru;rd  .-r  ,■  r  -  t .  ra¬ 
il,,,  0  ’4/3  k=l  k 

tions  of  the  until  loop  executed  until  termination.  The  <•  v  *  !  -  •  f 


I  is  O(log  n)  .  Each  execution  of  this  loop  takes  only  *  her 


of  steps.  Hence  by  Lemma  3,  we  have  Theorem  1. 


APPENDIX  II.  Proof  of  RANDOM-MATE1 


It  is  easy  to  show  that  Lemma  1  holds  and  further  more  its  induction 
hypothesis  also  holds.  Let  =  {(u,v)  € E J tree-edge (u,v)}  be  the 
forest  defined  on  the  i-th  iteration  of  the  modified  Main  Body  of  RANDOM- 
MATE'.  Let  a  i-loop  be  on  edge  which  on  the  i-th  iteration  departs  and 
enters  the  same  tree  of  F!.  Let  a  tree  T  of  F!  be  sem -active  if  at 

X  W  X 

least  1/log  n  of  the  set  of  edges  departing  Tw  are  not  i- loops.  Let  n 
be  the  number  of  semi-active  trees  of  F^. 


LEMMA 


5.  If  ^>1,  Prob(n^+1<(l-l/4e)np  >1/2. 


Proof.  Let  T  be  a  semiactive  tree  of  F!  which  is  not  of  maximal  size 
-  w  i 


with  root  w=r(w).  By  the  pigeon-hole  principal,  there  are  at  least  one 

vertex  v  of  T  where  R(v)  =w  and  at  least  1/log  n  of  its  departing 
w 

edges  are  not  i-loops.  Let  (u,v^)  ,  (u,v^  be  the  edges  chosen 


by  the  d^^  log  n  processors  of  Pv  on  the  i-th  iteration.  For  each 
j  *l,...,d  log  n,  with  independent  probability  at  least  1/log  n,  R(u)  / 


R(Vj).  (Note:  the  independence  is  due  to  the  fact  that  the  processors 


initially  choose  random  edges  independently  of  each  other.)  Further¬ 
more,  if  R(u)  y  R(v.)  then,  Prob(sex(R(u) )  =  female  and  sex(R(v.))  = 
male)  =  1/4.  Since  d,  >2,  the  probability  that  T  does  not  merge  into 
some  other  tree  on  the  i-th  iteration  is  at  most  (l/4)(l-l/log  n)109  n 
<(l/4)e  \  An  argument  similar  to  Lemma  2  then  shows  that  n|+^  *s  uPPer 
bounded  by  a  binomial  with  expectation  (l-l/4e)n^  and  hence  Prob(n^+^ 


<  (l-l/4e)n|) )  >  1/2. 


Recall  that  dQflog  n1  is  the  number  of  iterations  of  the  Main  Body 


Of  RANDOM-MATE' . 
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LEMMA  6.  Va>0  3d„  >1  suck  that  tnere  are  no  serriactive  trees  in  F ,, 

0  d  riog  n1 


with  •probability  of  least  l  -  l/n 


P roof .  Let  iQ  =  0  and  inductively  for  k=l,2,...  let  be  the 

minimum  number  such  that  nA  <  (l-l/4e)n!  .  By  Lemma  5,  each  is  bounded 

*  ak-l 

by  an  independent  geometric  variable  with  expectation  2.  Let  k  be  the 

kl 

maximum  number  such  that  ^  ^  d^rlog  n1 .  Now  suppose  there  is  an 

ct 

active  tree  in  F'  ,  .  with  probability  more  than  l/n  .  Then  the 

k  Vlogn1 

bound  on  i^  given  by  Lemma  3  is  violated,  a  contradiction.  c 

Since  each  iteration  of  the  Main  Body  of  RANDOM-MATE'  takes  only 

.  Ol 

constant  time.  Lemma  6  implies  that  with  probability  at  least  1-1/n  , 

the  number  of  the  "nonloop  edges"  after  execution  of  Main  Body  is 

at  most  m/log  n.  By  Theorem  1,  the  final  execution  of  RANDOM-MATE  on  this 

reduced  graph  takes  6 (log  n)  time  using  m/log  n  processors.  Hence  we 

have  proved  Theorem  2.  C 


» 
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APPENDIX  III.  Proof  of  RANDOM-MATE" 

Again,  it  is  easy  to  show  by  induction  that  Lemma  1  holds 

LEMMA  k.  On  termination ,  F  =  {{u,v}  €  e|  tree-edge  (u,v) }  -is  a  spanning 
tree  of  g. 

Furthermore,  the  induction  hypothesis  stated  in  the  proof  of  Lemma  1, 
also  holds  for  RANDOM- MATE" . 

On  the  j-th  iteration  of  the  Outer  Loop  and  the  i-th  iteration  of  the 
Inner  Loop, 

(1)  let  F.  .  =  {{u,v}  €E j tree-edge (u,v)  } 

i  .3 

(2)  let  E^  _.cd(E)  be  the  set  of  edges  remaining 

(3)  let  D.  . (T  )  c  e.  .  be  the  set  of  edges  departing  vertices  in 

*  t  J  W  1  9  D 

T  ,  where  T  .  is  a  tree  of  F.  . 
w  w  x 

(4)  let  a  tree  T  of  F.  .  be  active  if  at  least  half  cf  tne  edges 

w  i,3 

of  D.  .  (T  )  do  not  enter  a  vertex  of  T 

1,3  w  w 

(5)  let  n.  .  be  the  number  of  active  trees  in  F.  .  that  are 

i.3  i»3 

not  of  maximal  size. 

The  following  Lemma  is  similar  to  Lemma  b. 

LEMMA  7.  If  n.  .  >  1,  Prob(n.  ,  .  «*(7/8)n.  .)  >  1/2. 
i»3  1+1.3  1,3 

Proof.  Let  T  be  an  active  tree  of  F,  .  which  is  not  of  maximal  size, 
w  l  ,3 

and  containing  vertex  w  *  r(w).  Then,  since  T^  is  not  of  maximal  size, 

there  exists  at  least  one  edge  (u,v)  Ed.  . (T  )  entering  a  distinct  tree 

1,3  w 

Tr(vJ  where  w  y  R(v) .  Furthermore,  since  Tw  is  assumed  to  be  active, 

by  the  pigeon-hole  principal  for  some  p£P  such  that  E(p)  CD  .  (T  )  at 

-  1,3  w 

least  one  half  of  the  edges  (u,v)  CE(p)  have  R(v)  y  R(u).  Let  (u,v)  be 
any  edge  of  E(p).  With  probability  1/4,  sex(w)  = female  and 
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sex(R(v))  =  male  when  R(u)  =  w  and  R(v)  ?  w.  Furthermore,  if 

(u,v)  is  a  random  edge  of  E(p),  Prob(R(v)  /  R(u))  >  1/2.  Hence  with 

probability  of  least  1/8,  is  merged  into  some  other  tree  on  the  i,j 

iteration.  As  in  the  proof  of  Lemma  1,  we  observe  that  the  worst  case  is 

where  there  is  exactly  one  edge  (u,v)  € D,  . (T  )  such  that  R(v)  /  w,  and 

i  ,3  w 

when  each  R(v)  is  distinct.  In  this  worst  case,  n.  ,  .  is  upper  bounded 

i+l,3 

by  a  binomial  variable  B  which  is  the  sum  of  n.  .  independent  Bernoulli 

1,3 

variables,  each  with  success  probability  7/8.  But  since  the  mean  of  B  is 

(7/8) n.  .  and  B  upper  bounds  n.  _  Prob(n  ,  .  ^  (7/8) n  ) 

1*3  1+1,3  1+1,3  1,3 

>  Prob(B  <(7/8)n.  .)  >  1/2.  □ 

1,3 

Recall  that  c^flog  n1  is  the  number  of  iterations  in  Inner  Loop. 

LEMMA  8.  Va  > o  Slc^  >  1  such  that  there  are  no  active  trees  in 

F  ,,  ,  ,  .  with  probability  at  least  1  -  l/na. 

o^'log  n’+l,3  r  ° 


Proof.  Let  i  .  =  0  and  inductively  for  k=l,2,...  let  i,  .  be 
“  ° ,  3  k ,  3 

the  minimum  number  such  that  n.  ^(7/8)n.  .  By  Lemma  7,  each  i  . 

1k,j  Vl,j  ,J 

is  bounded  by  an  independent  geometric  variable  with  expectation  2.  Let 

k2 

k  be  the  maximum  number  such  that  I  i  .  ^  c  'log  n’.  Now  suppose 

there  is  an  active  tree  in  F  r,  ,  ,  .  with  probability  more  than 

k  c^log  n  +1,3 

1/n  .  Then  the  bound  on  if  i.  .  given  by  Lemma  3  is  violated,  a 

k=l  k ,3 

contradi ction. 


Since  each  iteration  of  the  Inner  loop  takes  only  constant  time,  each 
iteration  of  the  Outer  loop  takes  total  time  0(max^p  |  E (p)  |  +  log  n) . 

Lemma  8  implies  that  with  probability  at  least  1  -  l/na,  the  number  of  edges 
assigned  to  each  E(p).  decreases  by  at  least  a  factor  of  two  on  each  iteration 
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of  the  Outer  loop.  Furthermore,  the  number  of  vertices  also  decrease  by  at 
least  a  factor  of  two  on  each  iteration  of  the  Outer  loop.  Thus  the  total 
execution  time  of  the  'loglog  n’  iterations  of  the  Outer  loop  is 

0(maXp£p  | E (p) | (1  +  1/2  +  ...)  +  loglog  n  log  n)  <  O(log  n  loglog  n) 

with  probability  at  least  1  -  1/n  ,  and  hence  is  0(log  n  loglog  n) . 

After  completing  all  these  iterations,  the  size  of  the  graph  has 
decreased  to  (m+n)/log  n  with  probability  1  -  l/nQ  We  then  can 
apply  Theorem  1  to  bound  the  execution  time  of  the  call  to  RANDOM-MATE 
(using  (m+n)/(log  n  loglog  n)  processors)  to  be  Odog  n  loglog  n)  . 

Thus  the  total  execution  time  of  RANDOM-MATE"  is  O(log  n  loglog  n)  using 
(n+m)/(log  n loglog  n)  processors,  proving  Theorem  3. 
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APPENDIX  IV-  Finding  Bi connected  Components 


Algorithm  BI CONNECT 


input  undirected  graph  G  =  (V,E) 


begin 


[1]  compute  a  spanning  forest  F  of  G 

[21  Root  each  tree  in  F  and  compute  its  preordering  and  the  number 

of  tree  descendants  of  each  vertex. 

[3]  Construct  a  forest  F'  derived  from  F  by  adding  a  new  induced 

vertex  vg  for  each  edge  e£F  and  in  place  of  edge  e  =  { °  ta  } 

substitute  edges  (u, ,v  }  and  (v  ,u_}. 

1  e  e  i 

[4]  Construct  a  graph  G’  =  (V',E*) 

where  V'  =  {v  |e  6f}  is  the  set  of  induced  vertices 

and  E'  =  {f  (u^  ,u2>  !  {u.^  ,u2 }  €  E-F} 

and  f(u, ,u_)  =  (v  ,v  }  where 
12  G1  e2 

if  u^,  u^  are  unrelated  in  F  (i.e.,  one  is  not  the  ancestor 
of  the  other)  then  e^,  €F  are  the  tree  edges  entering  u1»u2 
from  their  parents,  or  if  (without  loss  of  generality)  u^  is  the 
ancestor  of  u^  in  F,  then  e^EF  is  the  tree  edge  departing  u^  on 
the  tree  path  from  u^  to  u^  and  e^  £F  is  as  previously  described. 

[5]  Compute  the  set  C(G')  of  connected  components  of  G' . 

[6]  Construct  graph  G"  =  (V  UC(G') ,E")  from  F'  by  collapsing 
together  all  vertices  in  V'  which  are  the  same  connected  compo¬ 
nent  of  G'. 


[7]  Compute  the  set  B(G")  of  biconnected  components  of  G". 


[8]  Merge  together  all  biconnected  components  of  G"  connected  by 


articulation  points  in  C(G').  (To  do  this,  we  construct  a  graph 
G" '  *  (B(G"),E"'J  whose  vertices  are  the  biconnected  components 
of  G"  and  each  edge  E  E"'  connects  biconnected  compo¬ 

nents  B^jB^EBJG")  with  a  common  articulation  point  in  C(G’). 
Then  we  compute  the  connected  components  of  G" ' . ) 

[9]  For  each  edge  e  €e,  let  v be  its  induced  vertex  in  V',  let 
Cg  be  the  connected  component  in  C(G')  containing  vg,  let 

be  the  set  of  biconnected  components  in  B(G”)  with  articula 
tion  point  Cfi ,  let  Be  be  the  connected  component  of  G" 1 
containing  Sg. 

Output  B  for  each  e  €e. 


LEMMA 


7.  Velfe2€E,  are  in  the  same  biconnected  component  of  G 


iff  B  =  B 

el  e2 


Proof.  If  C  =  C  ,  then  we  can  find  a  path  of  tree  edges  in  F  from 
61  e2 


e^  to  e2 ,  and  also  a  disjoint  path  of  nontree  edges  in  E-F  from  e^ 


to  e^,  and  hence  e^,  e 2  are  in  the  same  biconnected  component  of  G 


(however,  the  reverse  is  not  necessarily  true) . 

Suppose  S  =  S  but  C  /  C  Any  biconnected  conponent  b€s 


is  thus  connected  in  G"  to  both  articulation  points  C  and  C 

el  e2 


Using  the  fact  that  B  is  biconnected,  we  can  find  two  disjoint  paths 


p  ,  p  in  G  between  edges  e' ,  e'  such  that  C  ,  =  C  and  C  ,  =  C 
12  12  ®1  ei  e2  e2 


But  since  C  ,  *  C  ,  we  can  find  disjoint  paths  p' ,  p'  in  G  from  e 

®1  ei 

to  e'  and  since  C  .  =  C  ,  we  can  find  disjoint  paths  p" ,  p"  in  G 
1  e2  e2  1  * 


from  ej  to  e^.  Moreover,  p^,  p^,  p^'  can  be  shown  to  be  disjoint  from 


p p  ,  p'l.  Hence  p' •  p  •  p"  and  pl*p2-p2  are  disjoint  paths  in  G 


from  and  e^ .  so  e^,  e 2  are  in  the  same  biconnected  component  (again, 

the  reverse  may  not  be  true) . 

Suppose  B  =  B  but  S  f  S  .  By  induction  on  the  minimum  length 
el  6  2  ei  e2 

of  paths  in  G" 1  from  S  to  S  ,  we  can  similarly  construct  two 

el  e2 

disjoint  paths  in  G  from  e^  to  e^.  Hence  e^,  e^  are  in  the  same 
biconnected  component. 

On  the  other  hand,  suppose  e,  e'  are  in  the  same  biconnected  compo¬ 
nent.  Then  there  is  a  simple  cycle  C  of  G  containing  both  e  and  e*. 

C  can  be  written  as  the  mod-two  sum  of  some  k  basis  cycles  C ,,..., C, 

lk 

with  respect  to  the  tree  of  F  which  is  the  spanning  tree  of  the  connected 
component  of  G  containing  e^  and  e^.  These  can  be  ordered  so  that  for 
each  i  >1,  has  at  least  one  edge,  say  e^ ,  in  common  with  C_.  from 
j  <i.  It  is  easy  to  verify  that  Bg  =  B^,  if  e,  e’  are  on  the  same  basis 
graph.  Suppose  they  are  on  the  distinct  basic  cycles. 


We  assume  an  induction  hypothesis  that  B  =  ...  =  B  for  some 

el  ei+l 

i,  1  <i^k.  Then  since  e.  is  in  both  C  and  C  for  some  j  <i, 

i  e.  e . 

i  3 

we  can  show  B  =  B  Thus  we  have  B  =  ...  =  B  .  But  e€c.  and 
e.  e.  e,  e,  i 

31  1  k 

e’  6C.  for  sane  i  and  j.  Hence  B  =  B  =  B  =  B  , .  □ 

3  e  e .  e .  e 

1  3 

The  spanning  forest  F  computed  in  step  [1] ,  and  the  connected  compo¬ 
nents  computed  in  steps  [5]  and  [8]  can  all  be  computed  using  our  randomized 
algorithms  RANDOM-MATE’  or  RAN DOM- MATE".  Since  F  and  G"  have  each  only 
0(n)  edges  and  vertices,  steps  [2]  and  [7]  can  be  computed  in  time  O(log  n) 
time  with  n  processors  by  the  results  of  [Tarjan  and  Vishkin,  83].  Note 
that  the  ancestor  tests  required  in  step  [4 }  can  be  done  (see  [Tarjan,  72]) 
in  constant  time  using  the  preordering  and  descendant  numbering  computed  in 
step  [2].  The  graph  constructions  in  steps  [3],  [4]  and  [8]  can  easily  be 
done  with  a  processor* time  product  0(m+n) .  Theorem  4  follows  from 


Theorems  2  and  3. 
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